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Abstract 

In  this  paper  we  construct  robust  models  for  active  portfolio  management  in  a  market  with 
transaction  costs.  The  goal  of  these  robust  models  is  to  control  the  impact  of  estimation  errors 
in  the  values  of  the  market  parameters  on  the  performance  of  the  portfolio  strategy.  Our  models 
can  handle  a  large  class  of  piecewise  convex  transaction  cost  functions  and  allow  one  to  impose 
additional  side  constraints  such  as  bounds  on  the  portfolio  holdings,  constraints  on  the  portfolio 
beta,  and  limits  on  cash  and  industry  exposure.  We  show  that  the  optimal  portfolios  can  be 
computed  by  solving  second-order  cone  programs  -  a  class  of  optimization  problems  with  a  worst 
case  complexity  (i.e.,  cost)  that  is  comparable  to  that  for  solving  convex  quadratic  programs 
(e.g.  the  Markowitz  portfolio  selection  problem).  We  tested  our  robust  strategies  on  simulated 
data  and  on  real  market  data  from  2000-2003  imposing  realistic  transaction  costs.  In  these  tests, 
the  proposed  robust  active  portfolio  management  strategies  significantly  outperformed  the  S&P 
500  index  without  a  significant  increase  in  volatility. 


1  Introduction 

Portfolio  management  is  concerned  with  allocating  capital  over  a  number  assets  to  maximize  a 
suitably  defined  measure  of  “return”  and  minimize  “risk”.  Although  the  role  of  diversification  in 
reducing  risk  has  long  been  a  part  of  financial  folklore,  Markowitz  [26,  27]  formulated  the  first 
mathematical  model  that  explicitly  modeled  the  risk-return  trade-off.  In  the  Markowitz  model  the 
“return”  of  a  portfolio  is  defined  to  be  the  expected  value  of  the  random  portfolio  return  and  the 
‘risk”  is  quantified  by  the  variance  of  the  random  portfolio  return.  Markowitz  showed  that,  for  a 
specified  lower  bound  on  the  return  ,  the  optimal  portfolio  can  be  computed  by  solving  a  convex 
quadratic  program. 
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The  analytical  tractability  of  the  Markowitz  mean-variance  model  led  to  development  of  the 
Capital  Asset  Pricing  Model  (CAPM)  for  asset  pricing  [35,  29,  23]  which  remains  one  of  the  most 
widely  used  models  for  equilibrium  asset  prices.  In  spite  of  its  theoretical  success,  the  practical 
impact  of  the  Markowitz  model  has  been  quite  limited  because  the  model  often  produces  “error- 
maximized  and  investment  irrelevant  portfolios”  [28].  This  behavior  is  a  manifestation  of  the  fact 
that  the  Markowitz-optimal  portfolio  is  extremely  sensitive  to  estimation  errors  in  the  parameters, 
i.e.  the  mean  and  variance  of  asset  returns,  and  often  amplifies  the  errors  several-fold.  A  num¬ 
ber  of  methods  have  been  proposed  for  mitigating  the  effects  of  parameter  uncertainty,  such  as 
constraining  the  portfolio  weights  [8,  14]  and  using  scenario  or  re-sampling  based  stochastic  pro¬ 
gramming  methods  [28,  40],  James-Stein  “shrinkage”  estimates  [9]  and  other  Bayesian  estimation 
techniques  [21,  13,  7,  39,  13,  32,  33,  22].  However,  these  approaches  do  not  explicitly  account  for 
parameter  uncertainty  in  the  portfolio  construction  step. 

To  model  the  effect  of  data  uncertainty  in  optimization  problems,  Ben-Tal  and  Nemirovski  [4,  3, 
5]  introduced  a  deterministic  framework  called  robust  optimization.  Using  this  approach,  Goldfarb 
and  Iyengar  [15]  proposed  a  robust  portfolio  selection  model  that  assumes  that  the  market  param¬ 
eters  lie  in  known  and  bounded  uncertainty  sets,  and  computes  a  portfolio  by  solving  a  max-min 
mean- variance  problem  that  assumes  the  worst  case  behavior  of  the  parameters.  The  uncertainty 
sets  in  [15]  correspond  to  confidence  regions  around  point  estimates  of  the  parameters;  consequently, 
there  is  a  probabilistic  guarantee  on  the  robust  portfolio’s  performance.  Moreover,  in  [15]  it  was 
shown  that  for  these  uncertainty  sets,  the  max-min  problems  can  be  reformulated  as  second-order 
cone  programs  (SOCPs).  Since  the  computational  complexity  of  an  SOCP  is  comparable  to  that  of 
a  convex  quadratic  program  of  similar  size  and  structure  [31,  24],  the  computational  effort  required 
to  compute  a  robust  portfolio  is  comparable  to  that  required  to  compute  the  Markowitz-optimal 
portfolio.  (See  [1,  24]  for  an  introduction  to  SOCP.  A  MATLAB  toolbox  to  solve  SOCPs  can  be 
downloaded  from  [36].)  In  the  computational  experiments  reported  in  [15],  the  performance  of  the 
robust  portfolio  was  generally  superior  to  that  of  the  non-robust  Markowitz  portfolio.  A  robust 
asset  allocation  model  with  different  uncertainty  sets  for  the  parameters  has  been  proposed  by 
Tiitiincii  and  Koenig  [38] 

Unlike  Bayesian  approaches  that  obtain  improved  estimates  of  the  mean  and  variance  of  asset 
returns  by  incorporating  uncertainty  into  the  underlying  asset  return  models,  robust  optimization 
based  approaches  model  the  mean  and  variance  to  be  uncertain  within  specified  uncertainty  sets. 
It  is  unclear  whether  either  of  these  approaches  is  unequivocally  superior  in  modeling  parameter 
uncertainty.  However,  the  robust  optimization  based  approaches  have  significant  computational 
advantage  since  the  associated  optimization  problem  scales  very  gracefully  with  the  problem  size 
and  one  can  add  many  convex  constraints  such  as  bounds  on  the  portfolio  holdings,  constraints  on 
the  portfolio  beta,  constraints  on  the  transaction  costs,  limits  on  the  “distance”  of  rebalanced  port¬ 
folios,  without  affecting  the  complexity  of  solving  the  portfolio  allocation  problem.  Furthermore, 
when  the  uncertainty  sets  are  confidence  regions  around  the  maximum-likelihood  estimates  for  the 
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problem  parameters,  the  robust  approach  provides  probabilistic  guarantees  on  the  performance  of 
the  computed  portfolios. 

The  goal  in  active  portfolio  management  is  to  beat  a  given  benchmark  by  using  information  that 
is  not  broadly  available  in  the  market.  In  this  paper  we  show  how  to  employ  robust  optimization 
techniques  to  significantly  improve  the  performance  of  active  portfolio  management.  The  main 
contributions  of  this  paper  are  as  follows. 

(a)  We  show  how  to  incorporate  robustness  with  respect  to  parameter  perturbations  into  mean- 
variance  models  for  active  portfolio  management.  Active  portfolio  management  attempts  to 
outperform  a  given  market  index  by  carefully  trading  in  assets  that  are  priced  incorrectly,  i.e. 
purchasing  undervalued  assets  and  short-selling  those  that  are  over-valued  (see  [17]  and  §2  for 
details).  Since  errors  in  estimating  the  returns  of  assets  are  expected  to  have  serious  conse¬ 
quences  for  an  active  strategy,  robust  models  are  likely  to  result  in  portfolios  with  significantly 
superior  performance.  The  results  of  our  numerical  experiments  with  both  simulated  and  real 
market  data  clearly  illustrate  the  benefits  of  using  a  robust  model.  The  models  proposed  in  [15] 
cannot  be  directly  applied  to  active  portfolio  management  because  the  performance  measures 
relevant  for  active  investing  are  always  relative  to  the  given  index. 

(b)  We  show  how  a  very  large  class  of  piecewise  convex  trading  cost  functions  can  be  incorporated 
into  an  active  portfolio  selection  problem  in  a  tractable  manner.  Since  active  portfolio  strategies 
tend  to  execute  many  trades,  properly  modeling  and  managing  trading  costs  are  essential  for 
the  success  of  any  practical  active  portfolio  management  model  [25].  Our  transaction  cost 
models  are  motivated  by  a  study  by  Loeb  [25]  where  it  is  shown  that  transaction  costs  are  a 
function  of  the  amount  traded  and  the  market  capitalization  of  the  traded  assets. 

(c)  We  show  that  side-constraints  such  as  net-zero  alpha  (see  §3.1)  and  investor  views  can  be  easily 
incorporated  into  the  robust  mean-variance  active  portfolio  selection  problem. 

(d)  We  propose  alternative  data-driven  models  for  active  portfolio  management  that  minimize 
Value-at-Risk  and  Conditional- Value-at-Risk  with  respect  to  a  given  benchmark.  We  also 
show  how  to  incorporate  robustness  with  respect  to  both  parametric  and  non-parametric  per¬ 
turbations  into  these  data-driven  models. 

The  organization  of  this  paper  is  as  follows.  In  §2  we  introduce  a  robust  mean- variance  model 
for  active  portfolio  management.  In  this  section  we  discuss  a  robust  factor  model  for  the  asset 
returns  (see  §2.1),  a  piece- wise  convex  trading  cost  function  that  can  be  calibrated  from  data  (see 
§2.3)  and  show  that  the  associated  robust  portfolio  selection  problem  can  be  reformulated  as  an 
SOCP  (see  §2.4).  In  §3  we  discuss  how  to  incorporate  side  constraints  such  as  net-zero  alpha  and 
investor  views  into  a  robust  mean-variance  active  portfolio  selection.  In  this  section,  we  also  discuss 
data-driven  non-parametric  models  for  active  portfolio  management.  In  §  4  we  report  the  results  of 
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our  computational  experiments  with  simulated  and  real  market  data  and  in  §  5  we  conclude  with 
a  brief  discussion  of  avenues  for  future  research. 

2  Robust  Mean- Variance  Active  Portfolio  Management 

In  this  section  we  propose  a  robust  mean- variance  model  for  active  portfolio  management.  Our 
model  uses  historical  returns  and  equilibrium  expected  returns  predicted  by  the  CAPM  to  identify 
assets  that  are  incorrectly  priced  in  the  market. 

There  is  a  fundamental  inconsistency  between  the  CAPM  and  active  portfolio  management. 
The  CAPM  assumes  that  markets  are  efficient,  i.e.  if  one  were  to  project  the  random  return  ra  of 
any  asset  on  the  random  return  r &  of  the  benchmark  to  obtain  linear  relation 

E  [ra\  =  aa  +  PaE[rb], 

the  expected  exceptional  return  aa  is  identically  equal  to  zero.  In  contrast,  active  management  is 
predicated  on  the  assumption  that  there  exist  assets  for  which  aa  ^  0  and  that  these  exceptional 
returns  can  be  predicted.  There  is  an  ongoing  debate  on  the  validity  of  the  efficient  market  hypoth¬ 
esis  and  the  CAPM  (see  [12]  and  the  references  therein)  that  is  beyond  the  scope  of  this  paper.  In 
this  paper  we  take  a  data  driven  approach  and  investigate  whether  historical  returns  can  be  used 
to  outperform  a  given  index. 

Active  portfolio  management  consists  of  two  steps.  The  first  step  is  to  forecast  the  vector  of 
exceptional  returns  a.  We  are  not  aware  of  any  statistically  justifiable  model  for  estimating  a. 
Practitioners  appear  to  employ  an  ad-hoc  combination  of  techniques,  such  as  asset  pricing  models, 
security  and/or  sector  analysis  and  analyst  views,  to  arrive  at  an  estimate  of  a  [17].  We  propose 
combining  historical  returns  with  the  CAPM  to  estimate  a.  The  second  step  is  to  use  the  estimates 
of  a  to  compute  a  portfolio  </>  that  outperforms  the  market.  Treynor  and  Black  [37]  introduced  a 
mean-variance  model  for  computing  such  a  portfolio.  This  model  shifts  the  market  portfolio  toward 
(away  from)  the  stocks  that  have  positive  (negative)  “alpha”.  Black  and  Litterman  [7,  19]  refined 
this  model  by  introducing  uncertainty  in  the  market  parameters.  Cvitanic  et.  al.  [10]  introduced 
a  dynamic  model  that  also  considers  hedging  demands.  In  this  paper,  we  propose  using  a  robust 
maximum  information  ratio  to  manage  an  active  portfolio. 

2.1  A  robust  factor  model  for  residual  returns 

We  use  the  robust  factor  model  proposed  by  Goldfarb  and  Iyengar  [15]  for  modeling  the  residual 
returns.  We  begin  by  projecting  the  random  return  vector  r  6  M”  of  the  n  assets  in  the  market  on 
the  benchmark  return  r 5  to  obtain  the  linear  relation 

r  =  (3rb  +  Ar,  (1) 
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where  (3  =  ((3 1, . . . , /3n)T 


t  =  cov(^) 
var(rb) 

denotes  the  “beta”  of  asset  i,  and  Ar  denotes  the  vector  of  residual  or  exceptional  returns.  In 
practice  (3  is  determined  by  applying  a  linear  regression  to  daily  asset  and  benchmark  returns 
and  r®  over  a  specified  historical  period  (i.e.,  sequence  of  trading  days,  t  =  1, . . .  ,  T). 

We  assume  that  the  benchmark  return  r b  ~  J\T(pbi  a and  the  random  residual  return  Ar  is 
given  by  the  factor  model 

Ar  =  a  +  V'f  +  e,  (3) 

where  a  =  E[Ar]  denotes  the  vector  of  expected  residual  returns,  f  ~  Af(0,F)  £  Mm  (m  <<  n) 
denotes  the  returns  on  the  m  factors  driving  the  market,  V  £  Mm><n  denotes  the  factor  loading 
matrix  of  the  n  assets,  and  e  r\j  AA(0,D)  £  Rn  is  independent  of  f.  The  notation  x  A /■(/*,£) 
denotes  that  x  is  a  multivariate  Normal  random  variable  with  mean  vector  p  and  covariance  matrix 
E.  The  covariance  matrices  F  >-  0  and  D  =  diag(d)  >-  0.  (The  notation  A  >-  0  denotes  that  A  is 
positive  definite.)  Thus,  Ar  ~  V'FV  +  D). 

In  practice  the  parameters  a,  F,  V,  and  D  in  the  factor  model  (3)  are  estimated  from  limited 
historical  data  and  are,  therefore,  subject  to  estimation  noise.  To  deal  with  these  estimation  errors, 
we  assume  that  the  expected  residual  return  a,  the  factor  loading  matrix  V  and  the  covariance 
matrix  D  of  the  residual  returns  are  uncertain,  with  uncertainty  sets  given  by  the  confidence  regions 
around  the  maximum-likelihood  estimates  of  a,  V  and  D,  respectively.  Thus,  the  individual 
diagonal  elements  di  of  D  lie  in  an  interval  [di  —  5j,di  +  <5,;] ,  i.e.  D  belongs  to  the  uncertainty  set 
Sd  given  by 

Sd  =  {D  :  D  =  diag(d),  di  =  di  +  A  di,  \Adi\  <  Si,  i  =  1,  ...,n}  ,  (4) 

and  the  uncertainty  set  Sv  for  the  factor  loading  matrix  V  is  of  the  form 

5,  =  {V:  V  =  V0  +  W,  ]|Wj||g  <  pj,  z  =  l,...,nj,  (5) 

where  W,  denotes  the  i— th  column  of  W  and  ||w||y  =  x/w'Gw  denotes  the  elliptic  norm  of  w 
with  respect  to  a  symmetric  positive  definite  matrix  G,  and  the  uncertainty  set  for  the  expected 
residual  return  vector  a  is  of  the  form 

a  £  Sa  =  {a  :  a  =  a0  +  £,  |&|  <  r/,:,  i  =  1, ...,  n}  .  (6) 

Methods  for  computing  cto,  Vi  Vo,  G,  p,  d  and  <5  from  historical  data  are  discussed  in  Appendix  A. 
For  a  detailed  justification  for  using  confidence  regions  as  uncertainty  sets,  see  [15].  For  simplicity, 
we  assume  that  the  factor  covariance  matrix  F  is  certain  but  this  assumption  can  also  be  relaxed 
and  all  of  the  methods  proposed  in  this  paper  can  be  extended  to  this  case;  see  [15]  for  details. 
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2.2  A  robust  portfolio  selection  model  with  transaction  costs 

Suppose  cf>  G  denotes  the  current  portfolio,  i.e.  c fa  is  the  amount  in  dollars  invested  in  asset  i. 
In  this  section  we  propose  an  optimization  model  that  computes  a  new  re-balanced  portfolio  cf>. 

Let  z,  y  >  0  denote,  respectively,  the  dollar  amounts  of  assets  bought  or  sold.  Then  the  resulting 
rebalanced  portfolio  <f>  £  W1  is  given  by 

0  =  0  +  z-y.  (7) 

Let  T(z,y)  denote  the  cost  incurred  by  the  trade  (z,y).  Then  we  must  have 

lr4>  +  T(z,y)  =  l1  4>,  (8) 

where  1  £  Mn  denotes  a  vector  with  every  component  equal  to  1.  From  (7)  and  (8)  it  follows  that 
T(z,  y)  =  l'(y  —  z).  Note  that  we  have  ignored  consumption  and  fresh  inflow  of  capital.  This  can 
be  easily  modeled  by  defining  a  suitable  consumption  function. 

We  assume  that  the  transaction  cost  function  T(z,y)  is  continuous,  increasing,  and  convex  in 
(z,y).  (  In  §  2.3  we  discuss  a  particular  specification  of  the  transaction  cost  function.)  In  order  to 
control  the  amount  of  the  transaction  costs  incurred  we  impose  the  constraint, 

T(z,y)  =  l'(y  -  z)  <  0w,  (9) 

where  9  £  [0, 1]  is  a  parameter,  and  w  is  the  amount  of  the  total  wealth  invested  in  the  rebalanced 
portfolio  cf>,  i.e.  the  wealth  after  the  trade  has  been  executed  and  the  transaction  costs  are  paid. 
In  addition,  we  impose  upper  and  lower  bounds  on  the  portfolio  holdings  as  a  percentage  of  the 
total  wealth,  i.e. 

— wv  <  cf)  <  wu  (10) 

A  positive  lower  bound  v  >  0  implies  that  short  sales  are  allowed. 

Next,  we  select  a  suitable  objective  for  the  portfolio  re-balancing  step.  The  goal  of  active 
portfolio  management  is  to  outperform  an  index  by  investing  in  assets  are  mispriced  with  respect 
to  the  benchmark  cf)b,  i.e.  assets  for  which  the  expected  excess  return  cx%  ^  0.  When  the  excess 
return  Ar  =  0  the  active  manager  would  hold  cf)  =  wcf>b ,  where  cf>b  is  the  normalized  (1  'cf)b  =  1) 
index  portfolio.  On  the  other  hand,  when  Ar  /  Owe  expect  an  active  portfolio  manage  to  hold  a 
portfolio  cf)  wcf>b.  Let 

if)  =  cf)  -  wcf)b,  (11) 

denote  the  active  component  if)  of  the  portfolio  cf>,  i.e.  if)  is  the  difference  between  the  portfolio 
cf>  and  the  passive  portfolio  wcf)b  which  invests  the  entire  net  wealth  w  in  the  market  portfolio  cf>b. 
The  random  return  r 'cf)  of  the  portfolio  cf>  can  be  decomposed  as  follows 

r 'cf>  =  (Av+rbf3)' cf>  =  Ar ' cf>  +  {(3' cf))rb  =  Ar'</>  +  (/3'i/))rb  +  wrb(P'cf)b)  =  Ar1  cf)+ ((3' i/))rb  +  wrb ,  (12) 
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where  we  have  used  (1)  and  (11)  and  the  fact  that  Ar '4>b  =  (r  —  rbf3)'<f>b  =  rb  —  rb/3' (f)h  =  0.  In 
the  active  portfolio  selection  literature,  the  component  ((3'if)rb  that  is  perfectly  correlated  with 
the  random  benchmark  return  rb  is  called  the  return  from  benchmark  timing  [17],  i.e.  the  return 
derived  by  carefully  timing  the  purchase/sale  of  the  benchmark.  Since  such  timing  is  likely  to  be 
very  sensitive  to  market  data,  the  term  is  very  often  negative.  To  protect  against  this,  most 
active  portfolio  selection  strategies  choose  if  to  be  uncorrelated  with  the  benchmark  return  rb,  i.e. 
i 3' if  =  0  or  equivalently 

(3'<f  =  w.  (13) 

Since  Ar  is  assumed  to  be  a  multivariate  Normal  vector,  (12)  and  (13)  imply  that  the  probability 
that  the  active  return  falls  short  of  the  benchmark  return  rj,  is  given  by 


P  (r '(f)  <  wrb )  =  P(Ar'0  <  0)  =  1  —  3>(IR(0)), 


(14) 


where  $(•)  denotes  the  cumulative  density  function  (CDF)  of  a  univariate  standard  Normal  random 
variable  and 

E[Ar  '(f>]  of  cf 


IR(0)  = 


(15) 


V'variAr7^]  ^'(V'FV  +  D)0‘ 
denotes  the  information  ratio  of  the  portfolio  <f>  [17].  The  information  IR(0)  is  the  Sharpe  Ratio 
of  the  exceptional  returns  of  the  portfolio  <f>. 

If  the  parameters  a,  V,  F  and  D  are  known  perfectly,  then  a  natural  objective  for  an  active 
portfolio  manager  is  to  maximize 

Vote)  =  nt(0),  (16) 


as  first  suggested  by  Treynor  and  Black  [37]  since  this  would  minimize  the  probability  of  shortfall. 

When  the  parameters  are  uncertain,  a  natural  objective  vr(<f)  is  to  maximize  the  worst  case 
IR ((f),  i.e. 

„,.(«)  =  (17) 

Combining  (7),  (9),  and  (10),  with  a  relaxed  version  of  (8),  we  have  that  the  set  of  feasible  <f  £  M” 
is  given  by 


$  = 


0  : 


w  —  1  '<f>  =  0,  w  —  (H'cj) 

T0  +  T(z,y)  <  T0, 


0  0  +  y  -  z  =  0, 


— wv  <  0  <  wu,  z,  y,  w  >  0. 


l'(y  -  z)  <  0w,  | 


(18) 


That  is,  there  exist  z,  y  £  M”  and  w  £  M+  in  addition  to  0  £  Mn  that  satisfy  the  constraints  in 
(18).  Note  that  by  relaxing  the  constraint  l'<f>  +  T(z,y)  =  T0,  we  ensure  that  $  is  convex.  In 
Appendix  B  we  show  that  this  relaxation  does  not  result  in  any  loss  of  generality. 

Thus,  a  non-robust  optimal  active  portfolio  is  any  optimal  solution  of  the  optimization  problem 
max{uo(0)  :  0  £  <h}.  Since  the  objective  uo(0)  of  this  problem  has  the  form  of  a  Sharpe  ratio, 
the  problem  is  often  referred  to  as  the  mean-variance  active  portfolio  selection  problem.  A  robust 
optimal  active  portfolio  is  any  optimal  solution  of  the  optimization  problem  max{ur(0)  :  0  £  4?}. 
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2.3  Transaction  cost  model 


In  this  section,  we  discuss  a  specific  form  for  T(z.y)  that  is  quite  general  and  yet  allows  one  to 
solve  the  active  portfolio  selection  problem  very  efficiently.  Following  Loeb  [25]  we  assume  that  the 
transaction  cost  for  each  asset  is  primarily  due  to  the  effect  that  trade  have  on  the  price  of  the 
asset  and  hence,  depends  on  the  market  capitalization  of  the  asset  and  the  size  of  the  trade.  We 
are  implicitly  assuming  that  many  of  the  trades  of  the  active  portfolio  are  fairly  large. 

We  begin  with  a  relatively  simple  transaction  cost  function.  We  assume  that  T( z,  y)  is  separable 
along  assets,  i.e. 

n 

T(z,y  )=^Tj(zj,yj)  (19) 

i= 1 

where  Tj(z*,  yi),  i  =  1, . . .  ,n  is  the  transaction  cost  function  for  each  asset  i,  and  that  the  cost  of 
buying  and  selling  an  asset  i,  i  =  1, . . . ,  n  is  the  same.  Hence,  Tj(zj,  y* )  =  Ti(zi  +  y.j),  i.e.  Tj(z*,  y*) 
is  a  function  of  Zi  +  yi.  Furthermore,  we  assume  that  the  function 

Ti(x)  =  max{$iix,  (20) 


where  >  0.  Thus,  T,;(x)  is  a  continuous,  increasing,  and  piecewise  convex  function  with  two 

pieces: 


Ti{zi,yi ) 


tin(zi  +  yi),  if  Zi  +  yi  <  7Tj 

3 

#i2 (zi  +  yi)  2 ,  if  Zi  +  yi>  7 Ti 

1 


(21) 


where  the  breakpoint  7r*  satisfies  i?i2  =  $ii  7q 

The  specific  form  in  (21)  is  motivated  by  the  data  reported  in  [25].  Figure  1  displays  the  per 
unit  transaction  cost,  i.e.  t(x )  =  as  a  function  of  the  trading  block  size  x  for  the  three  different 
market  capitalizations  classes:  $0.5~$lbn,  $l-$1.5bn,  and  >  $1.5bn.  In  order  to  better  understand 
the  structure  of  the  function  t(x),  in  Figure  2  we  plot  ln(f(x))  as  a  function  ln(x').  It  is  clear  from 
this  plot  that  for  each  of  the  capitalization  classes,  a  piecewise  linear  function  of  ln(x')  of  the  form 


ln(t(x)) 


Ol  if  0  <  X  <  7T 

02  +  03ffi(x),  if  7T  <  X 


(22) 


where  aq,  a.2,  <13  and  7r  are  parameters,  provides  a  very  good  fit  for  ln(f(x)).  The  least  squares 
estimates  of  the  parameters  (01,02,03,77)  for  the  three  asset  classes  are  given  in  Table  1.  (The 
best-fit  curves  are  also  plotted  in  Figure  1  and  Figure  2.)  The  parameters  in  Table  1  imply  that 
for  assets  with  a  market  capitalization  greater  than  $1.5bn,  the  transaction  cost  function 

.  .  ,  f  0.0115s,  x  <  e5-3647  =  2  1  3.7271, 

T  x)  =  t(x)x  =  <  ,  ,, .. 

I  0.0012xL414°,  x  >  213.7271. 


Therefore,  the  function  in  (21)  is  a  good  approximation  for  transaction  cost  function  implied  by 
the  best-fit  curve.  While  the  data  in  [25]  is  from  1982,  it  is  reasonable  to  assume  that  the  struc¬ 
ture  of  transaction  cost  (i.e.,  continuous,  piecewise,  and  convex)  illustrated  by  the  data  would  be 


Figure  1:  Per-unit  transaction  cost  t(x)  as  a  function  of  market  capitalization  and  the  amount 
traded. 


Figure  2:  Logarithm  of  per-unit  transaction  cost  t{x)  as  a  function  of  the  logarithm  of  the  block 
size  x. 
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$0.5-$lbn 

$1— $1.5bn 

>  $1.5bn 

ln(7r) 

4.474 

4.9087 

5.3647 

a\ 

-3.9374 

-3.9634 

-4.4654 

a-2 

-5.8241 

-6.1360 

-6.6864 

03 

0.4217 

0.4426 

0.4140 

Table  1:  Coefficients  of  the  least  squares  fits  in  Figure  2 


appropriate  to  other  time  periods,  although  scales  and  numerical  values  might  change.  We  discuss 
more  general  transaction  cost  functions  later  in  this  section. 

Next  we  show  that  for  transaction  cost  functions  of  the  form  (21),  the  set  {z  £  R"  ,  y  £  R"  ,  r  £ 
R+|T(z,y)  <  r}  can  be  represented  as  a  collection  of  second-order  cone  (SOC)  constraints  of  the 
form 

||  Ax  +  b ||  <  c1  x  +  d, 

where  x  £  is  the  decision  variable,  the  norm  ||-||  is  the  Euclidean  norm,  and  all  other  quantities 
are  constants. 

From  (19)  and  (20),  it  follows  that 

n 

T(z,y)  <  t  ^Ti<r,  Ti(zi,yi)  =  max.{&n(zi  +  yi),'&i2(zi  +  yi)2}  <n,  i  =  l,...,n 
i= 1 
n 

^2  Ti  <  T  dn(zi  +  yf)  <  Ti ,  di2{zi  +  Vi)*  <  Ti,  i  =  1, . . . ,  n.  (23) 

i= 1 

3 

The  following  two  results  (e.g.,  see  ([5])  or  ([1])  show  that  a  constraint  of  the  form  Oaizi  +  yi) 2  <  r* 
is  equivalent  to  a  collection  of  the  SOC  constraints. 


Lemma  1  The  constraint  y/x\X2  >  X3  where  x\  >  0,  x'2  >  0,  X3  >  0  is  equivalent  to  the  SOC 
constraint 


2x3 

x\  —  X2 


<  X\+  X2,  x\  >  0,  X2  >  0,  X3  >  0. 


Proof:  Squaring  both  sides  of  the  norm  inequality  and  simplifying  yields  the  equivalence.  ■ 

3 

Lemma  2  The  set  of  constraints  dx^  <  t,  x  >  0,  t  >  0,  where  d  >  0  is  a  constant,  hold  if  and  only 
if  there  exists  ti  >  0  satisfying 

Vdx  <  t,\  <  \fx.  (24) 

3 

Proof:  Suppose  that  dx 2  <  t.  Then  t±  =  y/x  satisfies  (24).  On  the  other  hand,  suppose  x,t,  and 
t\  satisfy  (24).  Then  we  have 

dx2  <  t\t  <  x?t  =$■  dx 5  <  t. 
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Lemmas  2  and  1  together  with  (23)  imply  that 


0  : 


w  —  l'<p  =  0,  w  —  (3’cj)  =  0  4>  +  y  —  z  =  4>,  l'(y  —  z)  <  9w,  'j 
+  E"=i  n  < 

+  y*)  <  Tj,  i  =  1, . . .  ,n 

2y/¥i(zi  +  yi) 


Ti  —  Hi 


<Ti  +  Hi,  i  =  l,...,n, 

<Zi  +  yi  +  1,  i  =  1, . . .  ,n, 


*2 

Zi  +  Vi  -  1 

—rev  <  <p  <  wu,  z,  y,  k,  r,  w  >  0. 


This  reformulation  has  important  practical  implications  since  large-scale  SOCPs  can  be  solved  very 
efficiently. 

We  now  present  a  more  general  class  of  transaction  cost  functions  that  still  allow  the  portfolio 
selection  problem  to  be  reformulated  as  an  SOCP.  We  still  assume  that  the  transaction  costs  are 
separable  by  assets;  however,  we  now  allow  for  the  transaction  costs  of  buying  and  selling  an  asset 
i  to  be  different;  i.e. 

n 

T(z>  y)  =  Ti~  +  Tr(y*)- 

i=  1 

All  the  transaction  costs  are  assumed  to  have  the  following  general  form: 


T(x) 


max 

i  <j<J 


{J2ajkxpjk}- 


(25) 


where  for  all  j,  k,  ajk  >  0,  and  pjk  >  1  and  rational.  The  transaction  cost  function  in  (25)  is  fairly 
general,  e.g.  functions  of  the  form 


T{x) 


a n®,  x  <  b i, 

4 

<  a2ix  +  0,22X3,  h<x<b2, 

.  CL31X2,  x  >  b-2, 


belong  to  the  allowed  class  of  transaction  cost  functions.  However,  it  does  not  cover  the  class  of  all 
piecewise  polynomial  convex  functions. 

The  constraint  T(x)  <  r  if,  and  only  if,  there  exist  variables  Tp.,  k  =  1, . . . ,  Kj ,  j  =  1, . . .  J, 
such  that 

Kj 

ajkXPjk  <  Tjk,  ^2  Tjk  <T  3  =  •  •  •  >J- 

k=  1 

a 

The  following  result  shows  that  constraints  of  the  form  9x~b  <  t  for  a  >  b  >  1  and  integer  can  be 
reformulated  as  a  collection  of  second-order  cone  constraints. 
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Lemma  3  ([5])  Let  a  and  b  be  positive  integers  with  a  >  b.  The  set  of  constraints  fixt  <  t,x  > 
0 ,t  >  0,  where  t?  >  0  is  a  constant,  hold  if  and  only  if  there  exists  nonnegative  variables  tki, 
k  =  1, . . . ,  q,  1  =  1,...,  2q+1~k  where  q  is  the  smallest  positive  integer  such  that  a  <  2q,  satisfying 


^b/2 


><7 

X 

< 

\ftq\tq2, 

tkl 

< 

\ftk-l,2l-ltk-l,2h  k  = 

=  2,... 

hi 

= 

-cT 

T— 1 

II 

'■—O 

-f-T 

hi 

= 

x,  l  =  b  +  1, . . . ,  2q  — 

a  +  b, 

hi 

= 

1,  l  =  2q  -  a  +  b+1, 

...,2q 

Proof:  The  proof  is  similar  to  that  of  Lemma  2  but  more  tedious.  See  [5]  page  105-107  for 

details  of  the  construction.  Note  that  we  defined  q  sets  of  new  (dummy)  variables  and  assigned 
specific  values  to  the  level  1  variables,  tu,  l  =  1, . . . ,  2q.  Note  also  that  the  above  construction  is 
not  unique;  there  are  many  ways  to  combine  variables  at  each  level.  When  a  =  b,  the  constraint 
fix's  =  fix  <  t  is  linear.  ■ 

2.4  SOCP  reformulation  of  the  active  portfolio  selection  problem 

The  robust  active  portfolio  selection  problem  is  given  by 

v*  =  maximize  vr(<f>)  =  nun{aeSaiVaeSv>DaeSd}  j  ^^+0)0}  ’  (26) 

subject  to  </>  £  <h, 

The  results  in  the  previous  section  imply  that  for  transaction  cost  functions  of  the  form  (25)  the 
constraint  set  <J>  can  be  expressed  as  a  collection  of  SOC  constraints.  In  this  section,  we  use  results 
in  [15]  to  show  that  when  v*  >  0  the  optimization  problem  (26)  can  be  reformulated  as  an  SOCP. 
Define  homogeneous  extension  of  <h  as  follows: 

<1  =  cl({(^,C):C>0 


f 

w  —  1'4>  =  0,  w  —  f3'<f>  =  0  <f>  +  y  —  z  =  C0, 

\ 

cl 

< 

lV+cnc-^cv^a'^ 

> 

I  —  wv  <  4>  —  wu,  l'(y  —  z)  <  9w, 

V 

/ - 

JS3 

S 

V 

0 

N. _ 

/ 

Note  that  (T(£~1z,  (~1y)  is  a  convex  function  of  (z,  y,  ()  for  f  >  0  whenever  T  is  a  convex  function. 
Moreover,  for  transaction  cost  functions  of  the  form  discussed  in  the  previous  section  the  constraint 
(T((~1z,  ( _1y)  <  r  reduces  to  a  collection  of  SOC  constraints.  For  example,  when  T  is  given  by 
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the  simple  two-term  cost  function  (21), 


$=  < 


(0,0  : 


w  —  l/0  =  0,  w  —  (3'<f>  =  0  0  +  y  —  z  =  £0,  l'(y  —  z)  <  0u>,  ^ 

i'0  +  £-LiA<a'0, 

Oa{zi  +  yi)  <  n,  i  =  l,...,n 

2V»i(zi  +  yi) 


Tj  —  At  i 


<r,:  +  Ati,  i  =  l,...,n, 
<Zi  +  yi  +  C,  i  =  1,  ■  ■  ■  ,n, 


2  At  A 

Zi  +  Ui  -  C| 

—rev  <  0  <  wu,  z,  y,  At,  r,  w,  C,  >  0. 


(28) 


Lemma  4  Let 


v 


* 

r 


minimize  rnaxVa6Si!  ,D„  e,S',i}  {  0'  ( V'F  V  +  D)0}, 
subject  to  minceg5a{a,0}  >  1, 

(0,0  €0. 


T/ien  we  have  following  possibilities. 


(29) 


(a)  £2/)  is  infeasible:  v*  <  0. 

(b)  (29)  is  feasible:  Let  (0,0  denote  any  optimal  solution  of  (29).  Then  (  >  0,  v*  >  0  and  £ ~lcj) 
is  optimal  for  (26). 

Proof;  Suppose  v*  0.  Then  there  exists  0  such  that  i iiin^gjt,'  t  cy  0  =  £  >  0  and  (£  1 0,  0  is 
feasible  for  (29).  This  establishes  (a). 

Suppose  (0,  0  is  feasible  for  (29)  and  £  =  0.  Then  mincegsa  a! <f>  >  1  implies  that  0/0. 
Assume,  for  concreteness,  that  <F  is  given  by  (28).  The  proof  will  proceed  in  an  similar  manner  for 
general  <F.  From  the  constraints,  it  follows  that 

n 

0  <  W  =  l/0  =  —  ^2  Ti  —  0- 

i=  1 

Thus,  w  =  0  and  Tj  =  0,  for  all  i  =  1, . . .  ,n.  Therefore,  z  =  y  =  0;  hence,  0  =  y  —  z  =  0.  A 
contradition.  Thus,  we  must  have  £  >  0  for  all  (0,0  feasible  for  (29). 

Suppose  (0,5)  is  optimal  for  (29).  Then  (£)^x0  e  <h.  Therefore, 


v *  >  min 

{aeSa.VcGS^.D^eSd} 


min 

{ae5a,Vae5„,Dae5d} 


o'(C  0) 


(C-Vy^'FV  +  D)^-^) 


Oi'cj) 


0  (V'FV  +  D)0 


>0, 
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where  the  first  equality  follows  from  the  fact  that  the  objective  ur(0)  is  homogeneous  with  degree-0. 

Suppose  v*  >  1/y/d*.  Let  0  G  <f>  denote  any  optimal  solution  of  (26).  Since  v*  >  0,  it  follows 
that  (  =  min aesa  a' $  >  0-  It  is  easy  to  check  that  (C^1 0,  ()  is  feasible  for  (29)  and 


max  {(C_10)/(V/FV  +  D)(<f_10} 
vQeS„,DaeSd} 


( v *)2 


<  vl 


A  contradiction.  Therefore,  it  follows  that  v*  =  — and  (  *0  is  optimal  for  (26).  This  estab- 

v 

lishes  (b).  ■ 

In  practice,  we  rebalance  only  if  there  exists  a  new  portfolio  0  G  <f>  with  vr(<j>)  >  0.  Therefore, 
for  all  practical  purposes,  the  two  optimization  problems  are  equivalent.  It  is  clear  that  (29)  is 
equivalent  to 


minimize  i'  +  7 


subject  to  0V'FV0 

< 

v, 

for  all  V  G  Sv, 

0'D0 

< 

7, 

for  all  D  G  Sd 

q/0 

> 

1, 

for  all  ol  £  Sa. 

(0,0 

G 

4. 

From  the  definition  of  Sa  and  Sd  it  follows  that 


(30) 


Ol'4>  >  1,  V  OL  G  Sa  47  Q!q0  —  Vj  1 0|  >  1, 


47  a'0(j>  -  0  >  0,  0  >  -0, 


(31) 


where  rj  =  (771, . . . ,  r]n)',  and 
0'D0  <  7,  V  D  G  5d  47  07  diag(d  +  <5)0  <  7  47 


2  diag(d  +  <5)1/20 

1-7 


<1  +  7,  (32) 


where  <5  =  (Si, ,  Sn)'.  In  [15],  it  was  shown  that  for  fixed  portfolio  0  and  a  number  u.  0  V/FV0  < 
v,  for  all  V  e  Sv  if,  and  only  if,  there  exist  a,  4  >  0  and  h  G  R0  satisfying 


v, 

1 

Amax(H)  ’ 

(7  +  4,  (33) 

1  -  a\i  +  hi,  i  =  1, . . .  ,m, 

where  r  =  p'|0|,  QAQ'  is  the  spectral  decomposition  of  H  =  G_5FG~+  A  =  diag(A),  and 
g  =  Q/H2G2'V'o0.  From  (31),  (32)  and  (33),  it  follows  that  the  robust  maximum  information 
ratio  problem  (26)  is  equivalent  to  an  SOCP.  Since  the  non-robust  active  portfolio  selection  problem 
is  a  special  case  of  the  robust  problem  where  the  uncertainty  sets  are  all  singletons,  it  follows  the 
non-robust  problem  is  also  equivalent  to  an  SOCP. 


4  +  lh  < 
a  < 
2  r 


cr  —  4 

2  9i 

1  —  a\i  —  hi 


< 


< 
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The  fact  that  the  active  portfolio  selection  problems  are  SOCPs  has  important  theoretical  and 
practical  implications.  Since  the  computational  complexity  of  an  SOCP  is  comparable  to  that 
of  a  convex  quadratic  program,  it  follows  that  robust  active  portfolio  selection  is  able  to  provide 
protection  against  parameter  fluctuations  at  very  moderate  computational  cost.  Morever,  a  number 
commercial  solvers  such  as  MOSEK,  CPLEX  and  Frontline  System  (supplier  of  EXCEL  SOLVER) 
provide  the  capability  for  solving  SOCPs  in  a  numerically  robust  manner. 

3  Alternative  models  for  robust  active  portfolio  management 

3.1  Side  constraints  on  alpha 

In  §2.1,  we  assumed  that  the  random  exceptional  return  Ar  is  given  by  the  robust  factor  model 

Ar  =  a  +  V'f  +  e 

where  a  £  Sa,  V  £  Sv  ,  f  ~  ■A/"( 0,  F)  and  e  ~  A/^(0,  D),  and  the  sets  Sy  and  S ^  are  defined  in  (6), 
(5)  and  (4),  respectively.  In  §2.4  we  further  assumed  that  the  uncertain  parameters  (ct,  V,  D)  could 
be  independently  chosen  to  minimize  the  value  of  the  information  ratio  IR (</>).  This  assumption  leads 
to  very  pessimistic  estimate  of  the  performance  of  the  portfolio  0,  and  as  a  consequence  (29)  can 
become  infeasible  even  when  there  exist  portfolios  with  reasonable  active  returns.  This  pessimistic 
behavior  can  be  controlled  by  suitably  constraining  the  set  of  feasible  market  parameters.  In  this 
section,  we  survey  some  of  the  constraints  that  still  allow  the  portfolio  selection  problem  to  be 
recast  as  an  SOCP. 

3.1.1  Net  zero  alpha 

A  net  zero  alpha  constraint  requires  that  the  return  vector  a  satisfy  Xag/  =  Yliei  ao*  for  some 
set  of  indices  I  C  {1, . . .  ,n}  (e.g.  stocks  in  a  particular  industry).  Such  a  constraint  ensures  that, 
although  an  individual  Oj,  i  £  /,  may  differ  from  its  nominal  value  «o i-,  the  average  deviation  is 
equal  to  zero.  Moreover,  the  robust  constraint  minaesa  c/0  >  1  now  becomes 

a'04>  +  min  <  £'0  :  Y  &  =  °,  |£|  <  V  \  =  ao4>  ~  rH  W  ~  Y  rh  ^  ~  Al  -  L 

l  iel  J  igl  i£l 

where  A  is  a  new  decision  variable.  Thus,  net  zero  alpha  constraints  are  modeled  by  linear  con¬ 
straints. 

3.1.2  Incorporating  analyst  views 

We  propose  two  methods  for  incorporating  analyst  views  into  robust  active  portfolio  selection.  The 
first  relies  on  Bayesian  analysis.  The  bounds  rj  defining  the  uncertainty  set  Sa  were  set  using  the 
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confidence  level  lo  and  the  posterior  density 


a  =  A7(a0,  £),  (34) 

implied  by  historical  data.  Suppose,  in  addition,  analysts  provide  the  portfolio  manager  with 
private  views  about  the  future  a.  For  example,  the  analysts  may  believe  that  P(aj  —  aij  >  ir)  =  rj. 
This  view  can  be  expressed  by  setting 


OLi  —  ctj=K~\-  cr(, 

where  £  ~  Af(0, 1),  the  parameters  (k,  a)  are  chosen  to  satisfy  ^rA/'(^^1)  =  f?  and  J-j\r  denotes  the 
CDF  of  a  standard  Normal  random  variable.  (Note  that  there  is  some  flexibility  in  the  choice  of 
(k,  a).)  Thus,  if  the  portfolio  manager  receives  m  different  analyst  views  of  the  form  P(p/a  >7 t)  =  77, 
they  can  collectively  be  represented  as 


Pa  =  A7(k,  A).  (35) 

for  suitably  chosen  P  €  Mmxn,  k,  6  and  A  £  Mmxm.  From  (34)  and  (35)  it  follows  that  the 
posterior  density  of  a  given  both  historical  data  and  analysts’  views  is  given  by 

a  =  A f(a,  S),  (36) 

where  S  =  (XD1  +  A^1)-1  and  a  =  £)(S~1ao  +  P'A^1^).  From  (36),  it  follows  that  the  co- 
confidence  region  for  a  is  given  by  s’ 

Sa  =  {a  :  (a  -  a)'E_1(a  -  a)  <  ^(w)}, 

where  xn  denotes  the  CDF  of  a  y2  random  variable  with  n  degrees  of  freedom.  Thus,  the  robust 
constraint  min^gs^la'c/)}  >  1  is  equivalent  to  the  SOC-constraint 

A>-X“1(^)l|S50||  >  1. 

This  method  of  incorporating  views  is  identical  to  that  used  in  the  Black-Litterman  model  [7]  - 
with  the  added  feature  of  using  second-order  information  to  set  confidence  levels. 

In  practice,  however,  it  may  be  difficult  to  elicit  the  vector  k  and  the  covariance  A.  Our  second 
method  assumes  that  analysts’  views  are  of  the  form 

l  <  p'a  <  u 


or  equivalently 


/  u  +  l 
pa  =  — —  +v. 


v 


< 


u  —  l 
2 
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Suppose  the  manager  receives  m  analysts’  views.  These  can  be  expressed  as  |Pa  —  k\  <  A,  where 
P  £  Mmxn  and  k,  A  £  Mm.  Coupling  this  constraint  with  the  bounds  |ct  —  ckqI  <  r]  implied  by 
historical  data  results  in  the  uncertainty  set 

Sa  =  {a  :  | ck  —  ao|  <  Vi  |Pa  —  k\  <  A}, 

=  {a  :  a  =  ao  +  u,  Pu  —  v  =  k  —  Pao,  |u|  <  ij,  |v|  <  A}.  (37) 

This  uncertainty  set  is  completely  defined  by  setting  the  center  k  and  the  bounds  A.  We  believe  that 
eliciting  these  two  parameters  would  be  significantly  easier  than  eliciting  the  covariance  matrix  A. 
From  (37),  it  follows  that  the  robust  constraint  minQ,e,sct{a:/0}  >  1  is  equivalent  to 

+  (P«o  -  «)' 7  ~v'\ <t>~  P't|  -  |7|  >  li 

where  7  is  a  new  decision  variable.  It  is  easy  to  see  that  the  above  constraint  can  be  linearized, 
and,  therefore,  can  be  incorporated  into  an  SOCP. 

3.2  Data  driven  methods 

All  the  methods  discussed  in  §2.2  and  §3.1  are  parametric  methods  in  the  sense  that  return  data 
is  assumed  to  be  distributed  according  a  known  distribution  but  with  uncertain  parameters.  In 
this  section,  we  describe  data-driven  robust  methods  where  we  do  not  make  any  parametric  as¬ 
sumptions  about  the  return  distribution.  In  the  first  subsection  below,  we  consider  non-parametric 
shortfall  minimization.  In  the  subsequent  subsections,  we  introduce  robustness  into  the  shortfall 
minimization  problem  and  discuss  tractability  issues. 

3.2.1  Non-parametric  shortfall  minimization 

Let  :  k  =  1  and  { B ^  :  k  =  1,...,IV}  denote,  respectively,  the  historical  data 

on  the  return  r  £  Rn  of  the  n  assets  in  the  market  and  the  return  of  the  benchmark.  Recall 
that  in  §2.2  we  argued  that  a  natural  objective  for  an  active  portfolio  manager  is  to  minimize  the 
shortfall  probability  P(r '(j)  <  wr^)  (see,  e.g.  (14)).  Since  we  only  have  the  raw  historical  data,  we 
approximate  the  true  distribution  P  by  the  empirical  measure  P.  We  have  N  samples 

lk(ct>)  =  -  (DW)V>,  k  =  1, . . .  ,N,  (38) 

for  the  random  shortfall  of  the  portfolio  0.  Therefore,  the  active  manager’s  optimization  problem 
reduces  to 

minimize  ^=1  X  (7 k (0)) , 

subject  to  0  £  <!>, 

where  x(')  denotes  the  indicator  function  of  the  positive  axis,  i.e.  x(x)  =  1  f°r  x  ^  0  aRd  zero, 
otherwise.  The  optimization  problem  (39)  is  equivalent  to  a  mixed  0-1  SOCP  with  one  knapsack 
constraint,  and  hence,  is  not  a  convex  problem. 
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The  value-at-risk  (VaRe(0))  of  a  portfolio  0  at  probability  e  is  defined  as 

VaRe(0)  =  inf  {P((r&l  —  r )'0  >  x)  <  e}. 

It  follows  that  P((r&l  —  r )'0  <  0)  <  e  if,  and  only  if,  VaRe(0)  <  0.  Thus,  (39)  can  be  interpreted 
as  maximizing  the  probability  e  for  which  VaRe(0)  <  0. 

We  next  show  how  to  construct  a  tractable  convex  approximation  for  (39).  Since  x(x)  < 
(vx  +  1)+  for  all  v  >  0,  it  follows  that 

TV  TV 

<  jj + x)  (4°) 

k= 1  k= 1 

Let  0  =  v(j).  Then  (0,u)  €  $  (see  (27)).  From  (40)  it  follows  that 

minimize  £  ZZi  (7 fcWO  +  l)  +  ,  ^ 

subject  to  (0,  v)  e  <3?, 

is  a  convex  approximation  of  the  active  manager’s  problem.  Since  P((r&l  —  r);0  >  0)  <  E(u(r;,l  — 
r )' <f)  +  1)+,  it  follows  that 

E((r(,l  —  r)'(j)  +  n_1)+  —  v~1e  <  0  (42) 

implies  that  P((r&l  —  r)'0  >  0)  <  e,  or  equivalently,  VaRe(0)  <  0.  Since  the  left  hand  side  of  the 
expression  in  (42)  is  positive  for  t  <  0,  it  follows  that 

inf  1 1  +  -E((rf,l  —  r )'<f>  —  t)+|  —  0, 

implies  that  VaRe(0)  <  0.  Rockafellar  and  Uryasev  [34]  have  shown  that 

CVaR e(4>)  =  E[(rf,l  —  r )'<f>  |  (r&l  —  r)'<p  >  VaRe(0)]  =  inf  1 1  +  -E((rf,l  —  r •)'(/>  —  f)+|  •  (43) 

From  (43)  and  (42)  it  follows  that  CVaRe(0)  is  a  conservative  convex  approximation  for  VaR€(0) 
and  that  (40)  can  be  interpreted  as  maximizing  the  probability  e  for  which  CVaRe(0)  <  0.  Ne- 
mirovski  and  Shapiro  [30]  show  that  the  CVaR  is,  in  fact,  the  tightest  convex  approximation  of  the 
VaR.  Unlike  VaR,  CVaR  is  a  coherent  risk  measure,  and  is  becoming  popular  in  risk  management 
applications.  Thus,  the  sample  approximation  (40)  is  an  important  approximation  problem  in  its 
own  right. 

Let  0 *  denote  the  portfolio  that  minimizes  the  true  CVaR  (note  that  0*  is  not  computable  in 
any  practical  sense).  Let  0  denote  the  optimal  solution  of  the  sample-based  CVaR  optimization 
problem  (40).  Then  two  important  question  emerge: 

(i)  How  close  is  the  sample  approximation  of  the  CVaR  to  the  true  CVaR  of  the  portfolio  0? 

(ii)  How  close  is  the  CVaR  of  0  to  the  CVaR  of  the  true  optimal  0*? 
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Both  of  these  questions  can  be  answered  using  results  from  statistical  learning  theory  and  the 
Vapnik-Chervonenkis  dimension.  See  [20]  for  details. 

A  loss  function  p  that  maps  real  valued  random  variables  X  to  real  numbers  is  called  a  Chocquet 
loss  function  if  there  exists  a  non-decreasing  convex  function  v  :  [0, 1]  e- >  [0, 1]  such  that 


1.  v(0)  =  0  and  u(l)  =  1 

2.  p(X)  =  fy  F^itydvft),  where  Fx  denotes  the  CDF  of  the  random  variable  X. 

Since  the 

f1  dt  r1 

CXaRe(X)  =  E[X  :  X  >  VaRe(A)]  =  /  F~l{t)—  =  /  F^^dvit) 

J l-e  e  Jo 

for  v(t)  =  i(t  —  1  +  e)+,  it  follows  that  CVaRe  is  a  Choquet  loss  function. 

Let  =  (r^l  —  r )'cf>  denote  the  random  return  of  the  portfolio  <j>  below  the  benchmark  rj, 
and  let  p  be  any  Choquet  loss  function.  Then  another  natural  optimization  problem  for  the  active 
portfolio  manager  is  given  by 

minimize  p(  70),  ^ 

subject  to  0  G  $. 

The  sample  approximation  (40)  for  the  CVar  optimization  problem  can  be  extended  to  all  Choquet 
loss  functions  as  follows. 

p(Tp)  =  j  Fi ~*(t)dv(t), 

~  /  ^{t)dv{t), 

N 

=  9Jfc/[fc](0 ) 

k=  1 

where  Fr denotes  the  empirical  CDF  of  constructed  from  the  N  samples  7 (<j>)  =  (71  (^>), . . . ,  7iv(0))/ 
of  the  shortfall  (see  (38)), 

f  v((N-k  +  l)/N)  -v((N-  k-  1)/N),  k  =  1, 2, . . . ,  N  -  1, 

I  v(l /Ar),  otherwise, 

and  f[k](<t>)  denotes  the  sum  of  the  k  largest  terms  in  7 (4>).  Thus,  (44)  can  be  approximated  by 
the  following  sample-based  optimization 

^ ^  _  ni .  fr.  7 rh\ 

(45) 


minimize  YJk=i  %  f[k\  (0) , 
subject  to  cj)  G  <F 

Berstimas  and  Brown  [6]  use  linear  programming  duality  to  show  that 

f\k](<t>)  =  maximize  a7  7  ((f)),  =  minimize  kr ^  +  1'uW, 

subject  to  l'a  =  k,  subject  to  r^l  +  >  7(0), 

0  <  a  <  1,  u(fc)  >  0, 


(46) 
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Since  q  >  0,  the  dual  formulation  in  (46)  implies  that  (45)  is  equivalent  to  the  SOCP 


minimize  Ylk=i  Qk(kr^  —  l'u^), 
subject  to  1  —  —  7 (</>)  >  0, 

u(fc)>0,  k  =  l,...,N, 
<t>  G  <f>. 


(47) 


3.2.2  Choquet  loss  functions  with  parameter  uncertainty 

In  this  section,  we  use  results  in  §2.1  to  extend  the  above  data-driven  model  (47)  to  include 
parameter  uncertainty. 

From  (12),  it  follows  that 


(77I  —  r )'  (f>  =  Ar  '  (f>  =  (a  +  V'f )' <f> 


where  f  denotes  the  random  return  on  the  factors  driving  the  market.  Suppose  a  and  V  are 
uncertain  with  uncertainty  sets  Sa  and  Sv  respectively,  then  the  k- th  sample  7 k{4>)  =  (B^l  — 
D (k))'(j>  of  the  shortfall  of  the  portfolio  <f>  is  uncertain  with  the  uncertainty 

A  7fc  =  -Aa>  +  (fW)/AV<(), 

where  denotes  the  k- th  sample  of  the  factor  returns.  We  interpret  the  uncertainty  Ay*,  as 
follows:  7 k((/>)  is  a  sample  of  the  shortfall  in  the  past;  in  the  future,  however,  the  value  of  the 
parameters  a  and  V  may  shift  to  some  other  value  in  their  associated  uncertainty  sets  and  the 
“correct”  sample  of  the  new  shortfall  distribution  is  given  by  7 &(</>)  +  Ay^.  Therefore,  in  order  to 
protect  against  all  possible  parameter  shifts  one  should  solve  the  following  robust  version  of  (47) 

minimize  Ylk=i  Qkikr^  —  l'u^), 
subject  to  —  u^  —  7 j(4>)  —  (3  —  Vj  >  0, 
maxao+AaeSa{-Aa'())}  <  /?, 
maxvo+AVeS,, {(f )' A V</>}  <  uj, 
u(fc)>0,  k  =  l,...,N, 

(f>  e  $. 


j,  k  =  1,  ■  •  • ,  N, 
j  =  !,•••,  N, 


(48) 


Both  (47)  and  (48)  suffer  from  the  drawback  that  the  number  of  variables  in  the  problem  increasse 
as  0(N2),  where  N  is  the  number  of  samples.  Next,  we  describe  a  more  tractable  formulation  for 
a  restricted  class  of  Choquet  loss  functions. 

Let  vp(t)  =  l(t  —  1  +p)+.  Then  it  is  easy  to  check  that  any  non- decreasing  convex  function 
with  v(0)  =  1  —  v(l)  =  0  can  be  represented  as 


v(t) 


vp(t)m(dp) 
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where  m  denotes  a  probability  measure  on  [0,1].  Consider  the  restricted  class  of  Choquet  loss 
p  functions  such  that  the  corresponding  measure  mp  is  discrete,  i.e.  the  corresponding  convex 
function  vp  is  of  the  form 

s 

vP{t)  =  ^2msvPs(t) 

s= 1 

Since  the  loss  function  ps  corresponding  to  vPs(t)  is  CVaRPs,  it  follows  that  ps(4>)  =  inf t{t  + 
^jE((rftl  —  r)'<p  —  t)+,  and  the  loss  function  p  corresponding  to  vp  is  p  =  X^f=i  ?wsCVaRPs.  Thus, 
we  have  that  for  this  restricted  class  of  loss  functions  the  robust  problem  (48)  can  be  reformulated 
as  follows 

minimize  ]T)S=1  ms(3s, 

subject  to  ts  -  ^  Y,f=i  W  +  Vj  ~  ts)+  <  Ps, 

maxAaeSa{— Aa'0}  <  /?,  (49) 

7j(0)  +  maxAveS„{(f(j))'AV(/)}  <uj:  j  =  1, ...  ,N, 

4>  e  <f>. 

This  program  has  only  O(NS)  variables.  Thus,  this  formulation  become  much  more  attractive 
when  S  =  o(N). 

3.2.3  Choquet  loss  function  with  non-parametric  uncertainty 

In  this  section,  we  describe  robustness  with  respect  to  a  non-parametric  set  of  measures.  This 
measure  of  robustness  is  formulated  in  terms  of  the  following  Stackleberg  game  between  the  investor 
and  nature: 

1.  Investor  choose  a  portfolio  cf)  E  4>. 

2.  Nature  chooses  a  shortfall  distribution  such  that  the  corresponding  CDF  F  satisfies: 

(a)  sup3,eR|F(x)  —  F(x)  |  <  e,  where  F  denotes  the  empirical  distribution  implied  by  the 
vector  7 (cf)). 

(b)  F(Kmaxfc{7fc(</>)})  =  1. 

The  constraint  (b)  on  the  power  of  Nature  is  required  to  ensure  that  the  set  of  admissible  measures 
is  tight.  It  can  be  replaced  by  any  other  constraint  that  ensures  tightness.  The  Robust  CVaR 
optimization  problem  is  given  by 

maximize  max^ejr(^,)  E [Z  +  1)+,  ^ 

subject  to  tp  e  4>. 

where  Z  F. 
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Fix  </>  and  7  =  7 Let  7^)  denote  the  /c-th  order  statistic  of  the  vector  7.  Then  for 

x  £  [7(fc)>  7(fc+i))> 

|  F(x)  -  F(x)  |=|  F(x)  -  k/N  |<  max  {|  F(7(fe+1))  -  k/N  |,  |  F(7(fc))  -  k/N  |}  , 

where  F{x~ )  denotes  the  limit  from  the  left.  Thus,  it  follows  that 

sup  | F{x)  -  F{x) |  =  max^  {|  E(7(-+1))  -  k/N  |,  |  F( 7(fe))  -  fc/lV  |  J  .  (51) 

For  any  fixed  F  £  we  have  that 

N 

E(Z  +  1)+  <  ^  7(fc)  (^(7(fc))  -  ^(7(fc-i)))  +  (1  +  e)l{N)  (^(7(JV+i))  -  ^(7(at)))  ,  (52) 

fc=i 


where  7(0)  =  —00  and  7nv+i)  =  k1(N),  and  the  equality  holds  only  if  F  puts  all  the  probability 
mass  on  the  set  (70.)  :  k  =  1, . . . ,  N}  U  {k7(at)}.  From  (51)  and  (52),  it  follows  that 


max  E (Z  +  1)+  = 
Fer{  i/>) 


maximize  EfcLi(7(fc)  +  l)+Pk  +  («7(iV)  +  1)+Pn+i, 


subject  to 


E*=i  Pi -k/N 

eS1**  =  1, 

p  >  0. 


<  e,  fc  =  l,...,lV, 


For  e  <  1,  it  is  easy  to  check  that  the  optimal  solution  of  this  LP  is  given  by 

1  N 

F™  )  E(Z  +  1)+  =  (w  ~  £)  (7(m)  +  1)+  +  N  ^  (7(fc)  +  1)+  +  €(K7(iV)  +  1)+I 


k=m-\- 1 


where  m 


|\ZVe~|.  Thus,  the  robust  active  portfolio  selection  problem  (50)  is  equivalent  to 

maximize  ($  -  e)  (Ek=m(7(k)W  +  x)+)  + 

(e  -  (EfeLm+i(7(fe)  +  !)+)  +  e(K7(JV)(V>)  +  1)+, 

subject  to  tf)  £  <F. 


(53) 


Let  7fe(V>)  =  (7fc(*/0  +  !)+-  Then  Ef=m 7(fc) (V»)  +  1)+  =  J2k=ml(k)  is  the  sum  of  the  N-m  +  1 
largest  terms  in  the  vector  7(i/>).  Thus,  the  dual  formulation  for  the  sum  of  k  largest  terms  (see  (46)) 
it  follows  that  (53)  is  equivalent  to 


maximize  (^  —  e)  ((N  —  m  +  1)tT)  + 

(e  -  ((N  -  ?n)r(1)  +  l'uTi)  +  en, 

subject  to  >  7 j{4>)  +  1,  k  =  1,  2,  j  =  1, . . . ,  N, 

r«  +  uf  >0,  A:  =  1,2,  j  =  l,...,N, 

7T  >  «7j(0)  +  1,  j  =  l,...,At, 

7T  >  0,  U  >  0,  ■?/>  £  l>. 


(54) 
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DJIA  Dow  Jones  Composite  Average 
NDX  Nasdaq  100 
RUT  Russell  2000 


Table  2:  Base  set  of  factors 

4  Computational  Experiments 

In  this  section  we  report  the  results  of  our  numerical  experiments  with  the  portfolio  selection  model 
proposed  in  §  2.  We  conducted  two  sets  of  experiments.  The  first  set  of  experiments  compared  the 
performance  of  a  robust  active  portfolio  management  strategy  with  that  of  a  non-robust  version 
of  that  strategy  on  simulated  market  data.  The  results  of  this  experiment  are  presented  in  §  4.1. 
The  second  set  of  experiments,  described  in  §  4.2,  compare  the  performance  of  the  robust  and  non- 
robust  strategies  on  real  market  data.  In  these  experiments  we  do  not  use  the  alternative  models 
described  in  §3. 

In  both  sets  of  experiments  we  assumed  that  the  transaction  costs  were  given  by  (19)  and  (21), 
and  that  the  parameters  of  the  transaction  cost  functions  T,  (zj,  yt)  for  all  assets  i  were  the  same, 
i.e. ,  i3il  =  'd  and  7Tj  =  n  for  i  =  1, . . . ,  n.  To  set  6  and  ir  we  relied  on  the  data  in  [25],  which  was 
from  1982.  That  data  suggests  that  $  =  0.01  and  the  break  point  n  corresponds  to  $0.25mn  for  the 
large-cap  stocks.  We  assumed  that  the  parameter  d  is  the  same  today.  However,  since  the  price 
of  the  S&P  500  has  increased  approximately  10  times  from  119.15  on  August  3rd,  1982  to  1059.02 
on  November  3rd,  2003,  we  assumed  that  the  break  point  n  also  increased  10  times.  Hence,  we 
set  7 r  =  $2.5mn.  We  assumed  our  current  wealth  to  be  w  =  $100nm.  We  do  not  argue  that  these 
values  model  reality  exactly  -  estimating  the  exact  transaction  cost  function  is  beyond  the  scope 
of  this  paper.  However,  these  transaction  costs  are  reasonable  in  the  light  of  the  data  in  [25]. 

4.1  Experiments  on  simulated  data 

For  this  set  of  experiments  the  number  of  assets  n  was  set  to  n  =  200  and  the  number  of  factors  m 
was  set  to  m  =  38.  The  linear  model  given  by  (1)  and  (3)  implies  that  the  market  is  defined  by  the 
factor  covariance  matrix  Fs,  the  factor  loading  matrix  Vs,  the  covariance  of  the  residual  returns 
Ds,  the  “betas”  of  the  assets  (3s ,  the  mean  return  /i][  of  the  benchmark,  the  volatility  us  of  the 
benchmark  and  the  mean  exceptional  return  vector  ets.  The  superscript  “s”  indicates  that  these 
parameters  were  set  at  the  beginning  of  the  simulation  and  not  randomly  generated.  To  ensure  that 
the  simulated  assets  returns  were  realistic  we  used  the  following  strategy  to  select  these  parameters. 
We  randomly  selected  35  stocks  from  the  constituents  of  the  S&P  500  on  January  2,  2004  and  the 
3  major  market  indices  in  Table  2  and  deemed  them  to  be  the  factors  driving  the  market.  We 
set  the  factor  covariance  matrix  Fs  equal  to  the  (38  x  38)-covariance  matrix  estimated  using  the 
past  300  days.  Each  element  of  the  matrix  Vs  and  the  (3s  was  independently  sampled  from  a 
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jV(0, 0.5)  distribution.  We  set  the  simulated  benchmark’s  mean  daily  return  /if  =  6.5  x  10~4  and 
the  daily  volatility  as  =  0.01.  Each  diagonal  element  of  DA  was  sampled  from  a  uniform  distribution 
over  the  interval  [10^6, 10-4].  The  components  of  the  vector  cG  were  sampled  from  a  jV(0,  0.002) 
distribution.  As  noted  above,  the  samples  for  the  market  parameters  were  generated  once  at  the 
beginning  of  the  simulation  and  were  held  fixed  through  the  entire  simulation  run.  Given  the 
market  parameters,  the  daily  return  r -0  e  Mn  of  the  n  =  200  assets  is  given  by 

rW  =N{ns,Y.s),  (55) 

where 

ps  =  fj,bj 3 s  +  cG,  =  alPs(ps)'  +  (VA)/(F5')VS  +  Ds 

The  details  of  the  simulation  experiment  is  described  below. 

(a)  We  re-balanced  the  portfolio  every  T  =  60  trading  dates.  Thus,  we  bought  and  held  portfolios 
for  approximately  a  quarter.  Our  investment  horizon  was  9  periods. 

(b)  The  investor  only  observes  the  market  return  data,  i.e.  in  particular,  the  investor  does  not  know 
the  market  parameters  and  in  particular  the  factors  driving  the  market.  At  the  beginning  of 
each  investment  period: 

•  We  computed  an  estimate  S  of  the  covariance  matrix  of  asset  returns  using  the  market 
data  from  the  previous  H  =  5  periods,  i.e.  H  x  T  =  300  days. 

•  We  assumed  that  the  factors  for  the  particular  period  were  the  three  standard  indices  in 
Table  2,  the  benchmark  and  the  eigenvectors  corresponding  to  the  largest  eigenvalues  of 
£  accounting  95%  of  Tr(E).  Note  that  this  set  of  factors  is  different  from  the  true  set  of 
factors  driving  the  market. 

•  We  estimated  the  vector  /3  from  the  history  of  H  x  T  days. 

•  We  computed  the  estimate  F  of  the  factor  covariance  matrix  using  the  simulated  factor 
returns  from  the  history  of  H  X  T  days. 

•  We  set  the  confidence  level  u>  =  99%  and  computed  the  parameters  Qo,  r/,  Vo,  G,  p,  d, 
and  S  using  the  procedure  described  in  Appendix  A. 

(c)  We  set  the  parameters  6  =  0.2,  the  upper  bound  u  =  0.11  and  the  lower  bound  v  =  0.011. 
See  (18)  for  the  definition  of  9,  u  and  v. 

(d)  The  non-robust  portfolio  q ^  for  period  p  was  set  equal  to  an  optimal  solution  of 

maximize  vq  (cj>) 

^  (p) 

subject  to  <f>  G  &{4>0  ), 
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where  $p\i)  =  (1  +  r|p  ^)0oP  ^(i),  i  =  1, . . . ,  n,  rf  1')  is  the  return  on  asset  i  over  the  period 

(p)  —  —  (p) 

p,  3>(0O  )  denotes  that  the  portfolio  0  defining  <h  (see  18)  was  set  equal  to  0O  . 

Similarly,  the  robust  active  portfolio  was  set  equal  to  an  optimal  solution  of  the  robust  problem 

maximize  vr((f>) 

- —  (p) 

subject  to  0  €  <F(0,r  )i 

where  (j>lP\i)  =  (1  +  rlP~^  )(plP~1\i) ,  i  =  1, . . . ,  n. 

Both  the  non-robust  active  strategy  and  the  robust  active  strategy  started  with  an  initial  wealth 
=  $100mn.  In  the  first  period  we  assumed  that  transaction  costs  were  zero  -  this  ensured 
that  we  did  not  have  to  specify  an  initial  portfolio  0  to  compute  0*'1-*. 


(e)  The  results  reported  are  the  averages  over  50  independent  runs. 


We  summarize  our  numerical  results  in  a  series  of  plots  highlighting  different  performance 
measures.  Let  denote  the  wealth  of  a  portfolio  strategy  at  the  end  of  day  t.  Then 

n  t 

=  II  (f  +  Lf  +  rf0))^,  (p-l)T<t<pT,  (56) 

i= 1  k=(p—  1)T+1 


where  the  risk-free  rate  of  return  ry  =  3%  per  year,  denote  the  return  in  excess  of  the  risk-free 
rate  ry  on  asset  i  on  day  k ,  and  0^  is  the  portfolio  held  by  the  portfolio  strategy  in  period  p.  The 
relative  daily  wealth  rw ^  of  a  strategy  with  respect  to  the  index  B  at  the  end  of  day  t  is 


it)  - 


w 


(*) 


run  '  = 


w 


(t) 


where  denotes  the  cumulative  wealth  generated  by  investing  the  initial  wealth  =  $100mn 
in  the  benchmark. 

Figure  3  displays  the  relative  wealth  generated  by  the  robust  active  strategy.  In  this  plot  the 
solid  line  is  the  relative  wealth  averaged  over  50  simulation  runs,  the  box  at  each  period  is  the 
one  standard  deviation  interval,  and  the  vertical  lines  show  the  max  and  min  values  of  the  relative 
cumulative  wealths  in  that  period.  For  comparison,  the  average  relative  wealth  generated  by  the 
non-robust  strategy  is  depicted  by  a  dotted  line.  Figure  4  displays  the  results  for  the  non-robust 
strategy. 

Figures  5  and  6  plot  the  per-period  excess  return  Rp  of  the  robust  active  strategy  and  the 
non-robust  strategy  respectively,  where 


r(p)  = 


w 


{Tv) 


W 


C TP ) 


W{T(P- 1))  ...(Tfp- 1)) 

LXJ  r? 


i.e.  R ^  is  the  return  of  the  portfolio  strategy  in  excess  of  the  benchmark  return. 
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Robust-Relative  Wealth 


Figure  3:  Relative  wealth  of  the  robust  strategy 


Meanvar-RELATIVE  WEALTH 


Figure  4:  Relative  wealth  of  the  non-robust  mean-variance  strategy 
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Robust-Per  Period  Excess  Return 
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Figure  5:  Per  period  excess  return  of  the  robust  strategy 
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Figure  6:  Per  period  excess  return  of  the  non-robust  mean-variance  strategy 
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Robust-Percentage  turnover 


Figure  7:  Turnover  of  the  robust  portfolios 
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Figure  8:  Turnover  of  the  non-robust  mean-variance  portfolios 
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Figures  7  and  8  plot  the  portfolio  turnover  n  of,  respectively,  the  robust  and  the  non-robust 
strategies,  where 

i^-"i 

Note  that  the  percentage  turnover  can  be  greater  than  1  because  of  short  sales.  The  following 
observations  are  supported  by  the  results  displayed  in  Figures  3-8. 

(i)  The  robust  active  portfolio  strategy  is  always  above  the  benchmark,  i.e.  it  consistently  pro¬ 
vides  exceptional  returns.  This  is  not  true  for  the  non-robust  mean- variance  active  portfolio 
strategy  -  the  minimum  wealth  of  this  strategy  is  often  much  below  that  of  the  benchmark 
in  every  period! 

(ii)  The  minimiLm  wealth  of  the  robust  portfolio  strategy  in  any  period  is  higher  than  that  average 
wealth  of  the  non-robust  portfolio  strategy. 

(iii)  The  return  of  the  robust  portfolio  strategy  is  more  stable  in  the  sense  that  ratio  of  the 
minimum  to  the  maximum  wealth  in  any  period  is  larger  than  the  same  ratio  for  the  non- 
robust  strategy. 

(iv)  The  turnover  of  the  robust  strategy  and  the  non-robust  strategy  are  roughly  comparable  on 
average,  but  the  turnover  of  the  non-robust  strategy  is  more  variable  and  can  be  quite  large. 

(v)  The  average  number  of  assets  in  the  robust  portfolio  was  109.4  and  the  number  of  assets  in  the 
classical  mean-variance  portfolio  was  177.6.  Therefore,  on  average,  the  total  cost  of  managing 
the  robust  strategy  ought  to  be  lower. 

Since  the  robust  strategy  is  protecting  against  all  parameters  in  the  uncertainty  set,  one  would 
expect  that  the  robust  strategy  is  likely  to  perform  better  in  markets  where  the  parameters  fluctuate. 
The  next  set  of  experiments  test  this  hypothesis  by  modifying  the  daily  asset  returns  as  follows: 

r  =  X®),  with  probability  1  — 

r  =  A f(ji,  X),  with  probability 

where  /is  =  (3fnbS  +  a®  and  Xs  =  (j%/3s(/3s)t  +  (VS)'(FS)! Vs)  +  Ds  denote  the  mean  and  variance 
of  the  usual  simulated  returns,  the  shifted  mean  return 

fii  =  (f  -  0.1sign(/rf))/rf,  i  =  l,...,n, 

and  the  shifted  covariance  matrix  X  =  (j%/3s(/3s)t  +  (Vy(Fs)(V)  +  D,  with  Vij  =  Vij(  1  +  z)  and 
and  z  ~  A7(0,0.01).  Note  that  in  this  new  market  model,  with  probability  ?,  the  expected  return 
jli  shifts  in  a  direction  opposite  to  that  of  the  nominal  return  /if. 

Figures  9  and  10  plot,  respectively,  the  relative  wealth  of  the  robust  and  non-robust  strategy 
for  £  =  0.2,  i.e.  when  the  returns  deviate  from  the  regular  return  model  with  20%  probability. 
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Robust-Relative  Wealth 
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Figure  9:  Relative  wealth  of  the  robust  strategy  when  ?  =  0.2 

Again,  the  results  are  averaged  over  50  independent  simulation  runs.  Figures  11  and  12  plot  the 
excess  return  of  the  two  portfolio  strategies  for  =  0.2  and  Figures  13,  and  14  display  the  turnover 
of  the  two  portfolio  strategies.  When  =  0.2  the  average  number  of  assets  held  by  the  robust 
strategy  and  the  non-robust  strategies  were  86.06  and  181.50,  respectively.  These  plots  support  the 
following  observations. 

(i)  The  robust  strategy  outperforms  the  non-robust  strategy;  however,  the  excess  returns  of  both 
strategies  are  lower  than  their  respective  returns  in  a  stationary  market. 

(ii)  The  turnover  of  the  robust  strategy  is  significantly  higher  than  the  non-robust  strategy.  This 
fact,  together  with  the  observations  that  the  robust  strategy  holds  fewer  assets  and  has  a 
higher  excess  return,  lends  to  the  hypothesis  that  the  robust  strategy  is  better  able  to  react 
to  market  shifts. 

4.2  Real  Market  Data 

In  this  section  we  report  the  results  of  computational  experiments  with  real  market  data.  The 
universe  of  assets  in  these  set  of  experiments  was  the  stocks  that  were  part  of  the  S&P  500  index 
and  the  benchmark  was  the  S&P  500  index.  Thus,  the  goal  of  the  active  strategies  was  to  provide 
returns  in  excess  of  the  S&P  500  index.  The  strategies  invested  over  the  period  April  1,  2001  to 
November  10,  2003  using  data  starting  from  January  5,  2000. 

The  experimental  procedure  was  very  similar  to  the  one  described  in  the  previous  section. 
However,  in  contrast  to  the  simulation  runs,  here  we  have  a  single  sample  path. 
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Figure  10:  Relative  wealth  of  the  non-robust  strategy  when  ?  =  0.2 
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Figure  11:  Per  period  excess  return  of  the  robust  strategy  when  q  =  0.2 
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Mean  var-Per  Period  Excess  Return 
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Figure  12:  Per  period  excess  return  of  the  non-robust  strategy  when  ?  =  0.2 
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Figure  13:  Turnover  of  the  robust  portfolios  when  <;  =  0.2 


32 


RELATIVE  WEALTH 
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Figure  14:  Turnover  of  the  non-robust  portfolios  when  =  0.2 
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Figure  15:  Relative  wealth  of  the  strategies 
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percent  of  portfolio  Sharpe  Ratio 


Realized  Sharpe  Ratios 


Figure  16:  Realized  sharpe  ratios  of  the  strategies 
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Figure  17:  Percentage  turnover  of  the  portfolios 
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Figure  15  plots  the  relative  wealth  of  the  strategies  with  respect  to  that  of  S&P  500  and 
Figure  17  shows  the  percentage  turnover  of  the  portfolios.  Figure  16  displays  the  realized  Sharpe 
ratio  of  the  two  portfolio  strategies  and  the  S&P  500  index.  The  average  number  of  assets  held  in 
the  portfolios  were  72.2  for  the  robust  strategy  and  413.4  for  the  non-robust  strategy.  Thus,  the 
non-robust  strategy  hold  pretty  much  all  the  stocks  in  the  index. 

Since  the  results  above  refer  to  a  single  sample  path,  there  is  a  chance  that  they  suffer  from  a 
starting-point  bias.  The  next  set  of  results  attempt  to  correct  this  bias.  We  considered  N  =  60 
different  strategies  that  each  re-balance  every  T  =  60  trading  days  but  each  have  different  start 
date.  Thus,  a  strategy  that  starts  investing  on  day  d,  d  =  0, . . . ,  N  —  1,  re-balances  the  portfolio 
on  day  60fc  +  d,  k  >  0.  Let  w^\d)  denote  the  wealth  on  day  t,  t  >  1,  of  the  portfolio  strategy 
that  starts  investing  on  day  d  and  let  wtB(d )  denote  the  cumulative  total  wealth  on  day  t  when  the 
initial  investment  of  $100mn  is  invested  in  the  S&P  500  index  on  day  d.  If  the  S&P  500  index  has 
a  stationary  distribution.  Then  the  sequences  of  relative  wealth 


rw^\d) 


w(t+d\d) 

dPW 


t  >  0, 


corresponding  to  each  starting  date  d  are  independent  and  identically  distributed.  Therefore,  one 
can  treat  these  different  sequences  as  independent  investment  runs.  In  Figure  18  the  solid  lines 
correspond  to  the  average 

I  JV-i 

—  rw(t\d ) 

d=0 

for  each  of  the  two  strategies,  and  the  box  at  each  period  is  the  one  standard  deviation  interval  for 
the  relative  wealth  of  the  robust  strategy,  and  the  vertical  lines  show  the  max  and  min  values  of  the 
relative  wealths  in  that  period  for  the  robust  strategy.  Figure  19  displays  the  analogous  quantities 
for  the  non-robust  strategy.  From  these  plots  it  is  clear  that  the  robust  strategy  is  superior  to 
the  non-robust  strategy  and  the  insights  from  the  simulation  model  continue  to  hold  for  the  real 
market  data. 


5  Conclusion 

In  this  paper  we  show  how  to  use  robust  optimization  based  techniques  to  immunize  the  active 
portfolio  selection  problems  to  perturbations  in  parameter  values.  In  §2  and  §3  we  discuss  a 
number  of  different  models  for  active  portfolio  management  and  show  to  make  them  robust  to 
data  perturbations.  We  show  that  the  portfolio  selection  problem  in  all  of  these  models  can  be 
reformulated  as  an  SOCP.  We  also  develop  a  piece-wise  convex  model  for  trading  costs  which 
allows  a  great  deal  of  modeling  flexibility  without  increasing  the  computational  cost  of  solving  the 
portfolio  selection  problem  -  they  still  remain  SOCPs.  This  fact  has  important  theoretical  and 
practical  implications.  Since  the  computational  complexity  of  an  SOCP  is  comparable  to  that 
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Figure  18:  Relative  cumulative  wealth  of  the  robust  strategy  with  respect  to  the  S&P  500 
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Figure  19:  Relative  cumulative  wealth  of  the  non-robust  strategy  with  respect  to  the  S&P  500 
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of  a  convex  quadratic  program,  it  follows  that  robust  active  portfolio  selection  is  able  to  provide 
protection  against  parameter  fluctuations  at  very  moderate  computational  cost.  Morever,  a  number 
commercial  solvers  such  as  MOSEK,  CPLEX  and  Frontline  System  (supplier  of  EXCEL  SOLVER) 
provide  the  capability  for  solving  SOCPs  in  a  numerically  robust  manner. 

There  are  several  interesting  extensions  that  are  worth  exploring.  The  models  in  this  paper  are 
all  single-period  myopic  models.  Extending  the  portfolio  selection  model  to  a  multi-period  setting 
appears  to  be  non-trivial. 
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A  Defining  the  uncertainty  sets 

Suppose  the  market  data  consists  of  asset  excess  returns  {r^  :  t  =  1,...,T},  benchmark  returns 
{r^  :  t  =  1,  ...,T},  and  corresponding  factor  returns  {f^  :  t  =  1,  ...,T}  for  T  trading  days.  After 
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computing  (3  by  linear  regression  and  forming  Ar^  =  for  i  =  1, . . . ,  n  and  t  =  1, . . . ,  T, 

the  linear  model  (3)  implies  that 

n 

A  rf]  =  ai  +  ^2vjiff) +ef\  i  =  l,...,n,  t  =  l,...,T.  (57) 

j=i 

In  linear  regression  analysis,  typically  it  is  assumed  that  {e^  :  i  =  1  =  1  are 

all  independent  normal  variables  and  ~  A/"(0, of},  for  all  t  =  1  The  independence 

assumption  is  relaxed  in  AR.MA  models  at  the  cost  of  replacing  the  least-squares  estimation  by 
Kalman  filters  (see  [18]). 

Let  B  =  [f\f2,...,fT]  £  R,mxCC  be  the  matrix  of  factor  returns.  Collecting  together  terms 
corresponding  to  a  particular  asset  i  over  all  periods  t  =  1, . . . ,  T,  we  get  the  following  linear  model 
for  the  returns  {rf'>  :  t  =  1, . . .  ,T}, 

y  i  =  Ax,  +  e,, 


where 


y,  =  [  A rj  Arf  ' ' '  A rj  ]',  A  =  [  1  B'  ],  x,  =  [  c*  Ui  ■  ■  ■  Ku  ]' 

/i\  (rTH'\ 

and  €i  =  [e\  , . . . ,  e)  is  the  vector  of  residual  returns  corresponding  to  asset  i.  The  least-squares 
estimate  x,:  of  the  true  parameter  x,  is  given  by 

x,  =  (A'A)_1A,yi, 

if  rank(A)  =  m  +  1.  Substituting  y,  =  Ax,  +  e,,  we  get 

Xi  -  x,  =  (A,A)_1A/ei  ~  M(0,  X), 

where  X  =  erf  (A' A)-1.  Let  Q  G  R,Jx(m+1).  A  standard  result  in  regression  theory  states  that  if 
of  in  the  definition  of  X  is  replaced  by  s2,  where  .sf  is  the  unbiased  estimate  of  of  and  given  by 

„2  _  l|y* —  Axj||2 

rri  -I  1 

1  —  m  —  1 

then 

v  =  -T(Qjq  -  Qx,:)/(Q(A,A)~1Q/)_1(Qx,  -  Qx,) 

■’  s% 

is  distributed  according  to  the  F-distribution  with  J  degrees  of  freedom  in  the  numerator  and 
T  —  m  —  1  degrees  of  freedom  in  the  denominator  [2,  16];  i.e.,  the  probability  v  <  cj(lu)  is  o>,  or 
equivalently, 

P ((Qx,:  -  Qxi),(Q(A'A)'1Q7)_1(Qxi  -  Qx,:)  <  Jcj(to)s2)  =  ui  (58) 
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If  we  define  Q  =  [e2,  e3, . . . ,  em+i]  £  Rmx(TO+i),  then  Qx,  =  V,;  is  the  least  squares  estimate  to 
the  true  factor  loading  Qx,  =  V,  and  using  (58)  we  get 

P((Vi  -  Vi)'(Q(A'A)-1Q')-1(Vi  -  V,)  <  mcm{u)sfj  =  u  (59) 

Therefore,  the  set  Sv(u ) 

Sv(u)  =  {V  :  V  =  V0  +  W,  || Wj||g  <Pi,i  =  1, n|  , 

where 

V0  =  V,  G  =  (Q(A'A)-1Q/)-1  =  BB'  -  -L(B1)(B1)',  Pi  =  i  =  l,...,n, 

is  an  wn-confidence  set  for  the  factor  loading  matrix  V. 

To  construct  the  uncertainty  set  for  expected  residual  return,  define  Q  =  e^.  Then,  Qx,;  =  a, 
and  Qx;  =  on  are  the  true  expected  residual  return  of  asset  i  and  the  least  squares  estimate  of  the 
expected  residual  return,  respectively.  Therefore,  (58)  implies 

P(jd;  -on\<\J  (A/A)nciMsf)  =  w  (60) 

Since  the  residual  returns  e,  are  assumed  to  be  independent,  it  follows  that 

Sa(u)  =  {«  :  a  =  a0  +  £,  |^|  <  rn,i  =  1, ...,  n}  ,  (61) 

where 

ao,i  =  an,  Vi  =  \J (A'Ajj^ci  (w)s|,  i  =  1,  ...n, 
is  an  con  confidence  region  for  the  mean  return  (x.  Observing 

P e  Saiw)  X  sv(u))  =  1-P((/*,V)  £Sa(u)  x  Sv(u)), 

>  l-P((n)tSa(u)-P(VtSv(u;)), 

=  2 ojn  -  1, 

we  conclude  that  the  Cartesian  product  S{u)  =  SQ( u>)  x  Sv(lo)  is  a  joint  conhdence  region  of  (a,  V) 
with  ( 2con  —  1)  confidence. 

One  way  of  constructing  the  set  Sd,  is  to  use  a  bootstrap  tu-confidence  interval  [11]  around  each 
of.  In  our  formulation  we  used  the  mean  residual  variance  di,  for  each  i,  for  simplicity. 

B  Convex  relaxation 

Suppose  cj)  £  <3?,  where  is  defined  in  (18)  and  1/0  +  T(z,y)  <  V0.  We  show  below  that  there 
exists  a  portfolio  0  £  <1>  that  satisfies 

l'0  +  T(z,y)  =  l'4>.  (62) 
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with  v0(<p)  =  v0(<f>)  and  vr(<f>)  =  ty(0).  Hence,  relaxing  the  constraint  (62)  does  not  result  in 
any  reduction  in  the  optimal  objective  values  of  either  the  robust  or  non-robust  active  portfolio 
selection  problems. 

Define  a  new  solution  (0,z,  y,  0)  as  follows. 

0  =  s<j>, 

z  =  z  +  (s  —  1)  max{0,  0}, 

y  =  y  +  (s  -  l)max{-0,  0}, 

w  =  sw, 

where  the  max  operator  is  taken  component  wise.  Then 

1'0  +  T(z,y)  -  (1'0  +  T(z,y))  >  (s  -  1)1'0  +  \p'z  p'y\ 

=  (a  -  l)[w  +  (pz  +  py)'\4>\] 

where  p  =  (pz,  py)  is  a  subgradient  of  T(-,  •)  at  (z,y).  Since  T( z,y)  is  an  increasing  function, 
p  >  0  and  w  +  (pz  +  py)'\4>\  >  0.  Thus,  we  can  choose  an  s  >  1  such  that  (</>,z, y)  satisfies  (62). 

We  will  now  show  that  (<f>,z,y,w)  satisfies  the  remaining  constraints  defining  S.  First  note 
that 


0  —  z  +  y  =  s0  —  z  —  (s  —  1)  max{0,  0}  +  y  +  (s  —  1)  max{— 0,  0} 
=  s0  -  (s  —  1)0  -  z  +  y 
=  0  —  z  +  y  =  0. 


Also, 

l'(y  —  z)  =  l/(y  —  z)  —  (s  —  l)l'0  <  Ow  —  (s  —  l)w  <  Oiu  <  9w. 

Clearly  z  >  0,  y  >  0,  and  w  >  0,  and  all  the  remaining  constraints  defining  are  satisfied  because 
they  are  all  homogeneous  of  degree  1  in  0  and  w. 

Finally,  since  vq  and  vr  are  scale-invariant,  it  follows  that  v0(<f>)  =  uo(0)  and  vr{<j>)  =  vr{<f>). 
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